A methodology for designing semantic annotations

نویسنده

  • Harry Bunt
چکیده

This paper presents a methodology for designing languages for semantic annotation. Central in this methodology is the specification of representation formats as renderings of conceptual structures defined by an abstract syntax as set-theoretic constructs. An ideal representation format is defined as one that is able to represent all the conceptual distinctions made in the abstract syntax, and of which each representation encodes one and only one structure defined by the abstract syntax. The semantics of an annotation language is defined for its abstract syntax and is shared by all its representation formats; every ideal representation format is therefore convertible through a meaning-preserving mapping to any other ideal representation format. The methodology is called CASCADES after its four stages: Conceptual analysis, Abstract syntax, Semantics and Concrete syntax for Annotation DESign. The CASCADES model derives its usefulness from supporting a systematic design process for semantic annotations, giving due attention to the conceptual and semantic issues underlying choices in annotation formats, including support in the form of procedures for how to move from one stage of the design process to another, in particular for how to construct an abstract syntax given a conceptual analysis; how to define a semantics for a given abstract syntax; and how to map an abstract syntax to an XML-based concrete syntax. Three applications of the CASCADES methodology are discussed: (1) the design of an ISO standard for dialogue annotation starting from a conceptual analysis; (2) the analysis of existing annotation schemes such as those of the Penn Discourse Treebank and TimeML, as a basis for the development of ISO standards for semantic annotation; (3) the detection and repair of deficiencies in existing annotation schemes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Action Languages and Question Answering

This paper describes a methodology for designing Question Answering systems that utilize an action language ALM to allow inferences based on complex interactions of events described in texts. This methodology assumes the extension of the VERBNET lexicon with interpretable semantic annotations in ALM and specifies the use of several other NLP resources to produce ALM system descriptions for inpu...

متن کامل

Beyond Multimedia Integration: corpora and annotations for cross-media decision mechanisms

In this paper, we look into the notion of cross-media decision mechanisms, focussing on ones that work within multimedia documents for a variety of applications, such as the generation of intelligent multimedia presentations and multimedia indexing. In order for these mechanisms to go beyond the identification of semantic equivalence relations between media —which is what integration does— appr...

متن کامل

Treating metadata as annotations: separating the content markup from the content

The use of digital learning resources creates an increasing need for semantic metadata, describing the whole resource, as well as parts of resources. Traditionally, schemas such as Text Encoding Initiative (TEI) have been used to add semantic markup for parts of resources. This is not sufficient for use in a ”metadata ecology”, where metadata is distributed, coherent to different Application Pr...

متن کامل

Semi-automated Semantic Annotation of Learning Resources by Identifying Layout Features

It is now widely accepted that any kind of digital content must be somehow semantically annotated to be intelligently used by computer programs. Annotations can be metadata, descriptions, etc. When dealing with learning, most systems require the author to manually annotate resources so that the system can deploy a navigation strategy, an adaptive behavior etc. However this task is very problema...

متن کامل

The Effects of Multimedia Annotations on Iranian EFL Learners’ L2 Vocabulary Learning

In our modern technological world, Computer-Assisted Language learning (CALL) is a new realm towards learning a language in general, and learning L2 vocabulary in particular. It is assumed that the use of multimedia annotations promotes language learners’ vocabulary acquisition. Therefore, this study set out to investigate the effects of different multimedia annotations (still picture annotatio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013